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Abstract 

A heterogeneous collection is a datatype that is capable of stor¬ 
ing data of different types, while providing operations for look-up, 
update, iteration, and others. There are various kinds of heteroge¬ 
neous collections, differing in representation, invariants, and access 
operations. We describe HList — a Haskell library for strongly 
typed heterogeneous collections including extensible records. We 
illustrate HList’s benefits in the context of type-safe database ac¬ 
cess in Haskell. The HLIST library relies on common extensions 
of Haskell 98. Our exploration raises interesting issues regarding 
Haskell’s type system, in particular, avoidance of overlapping in¬ 
stances, and reification of type equality and type unification. 

Categories and Subject Descriptors: E.2 [Data Storage Represen¬ 
tations]; D.2.13 [Software Engineering]: Reusable Software; D.3.1 
[Programming Languages]: Formal Definitions and Theory 

General Terms: Design, Languages. 

Keywords: Collections, Extensible records. Type-safe database ac¬ 
cess, Dependently typed programming. Type-indexed rows. Type 
equality. Type improvement, Haskell. 

1 Introduction 

Programmers in typed functional languages are used to homoge¬ 
neous collections, where values of the same type are stored in lists, 
sets, and others. There exist collection libraries, e.g., Edison for 
Haskell [26], Homogeneous collections rely on parametric poly¬ 
morphism. C++ programmers are also used to homogeneous col¬ 
lections such as those in the Standard Template Library, likewise for 
Ada and Eiffel. Java programmers are about to receive support for 
parametric polymorphism, finally. This may end the use of weakly 
typed collections (“Everything is of type Object!”), which require 
run-time type casts with the potential of unappreciated exceptions. 
Unfortunately, the notion of typeful homogeneous collections fails 
to work for all the scenarios that require storing values of different 
types. Here is an open-ended list of typical examples that call for 
heterogeneous collections: 

• A symbol table that is supposed to store entries of different 
types is heterogeneous. It is a finite map, where the result 
type depends on the argument value. 

• An XML element is heterogeneously typed. In fact, XML 
elements are nested collections that are constrained by regular 
expressions and the 1-ambiguity property. 


*A shorter version of this paper appeared in the proceedings of 
the ACM SIGPLAN Haskell Workshop 2004, September 22, 2004, 
Snowbird, Utah, USA, Published by ACM Press. This longer ver¬ 
sion provides several appendices and some extra paragraphs. 


• Each row returned by an SQL query is a heterogeneous map 
from column names to cells. The result of a query is a homo¬ 
geneous stream of heterogeneous rows. 

• Adding an advanced object system to a functional language 
requires heterogeneous collections of a kind that combine ex¬ 
tensible records with subtyping and an enumeration interface. 

Weakly typed encodings are feasible for all the listed scenarios. For 
instance, a heterogeneously typed symbol table can be encoded us¬ 
ing a suitably universal type, or dynamic typing, or type-safe cast. 
The present paper introduces a strong typing discipline for het¬ 
erogeneous collections. We deliver a dedicated Haskell library 
HList, which covers collection types such as lists, arrays, ex¬ 
tensible records, type-indexed products and co-products. To this 
end, we advance techniques for dependently typed programming 
in Haskell [12, 21], and we rely on Haskell 98 with common ex¬ 
tensions for multi-parameter classes and functional dependencies, 
as available in the GHC and Hugs implementations. (We manage 
to avoid overlapping instances — in the end!) Our development 
does not introduce yet another language extension, which is an im¬ 
provement over earlier proposals for extensible records and other 
collection types [10, 31, 23, 29], We explore some murky waters of 
Haskell’s type system, such as the reification of type equality and 
type unification. While we have found portable, sound and practical 
ways around, more research is needed to deliver foundational clar¬ 
ifications that enable fundamental solutions. We identify the issues 
that need to be resolved. 

The paper is structured as follows. In Sec. 2, we review weakly 
typed techniques for dealing with heterogeneous collections. In 
Sec. 3, we introduce typeful heterogeneous lists, which provide the 
basis for the HLIST library. We then work out different kinds of 
access operations and collection types: 

• Sec. 4 — numeral-based access operations, 

• Sec. 5 — labelled collections (or records), 

• Sec. 6 — type-based access operations, 

• Sec. 7 — type-indexed products. 

In Sec. 8, we demonstrate the merits of heterogeneous collections 
in the context of type-safe database access in Haskell. In Sec. 9, we 
review our take on Haskell’s type system. In Sec. 10, we discuss 
related work, and we conclude in Sec. 11. There are several appen¬ 
dices with various details. The source code from the paper can be 
retrieved from [1], 




2 Not so strongly typed collections 

We use database programming for the motivation of the HLlST li¬ 
brary in this paper. We want to get to a point where SQL queries 
can be rephrased in Haskell in a typed and structured manner. As a 
simple example, let us attempt to encapsulate a simple SQL query 
in a Haskell function. The query should retrieve all animals (their 
keys and names) of a given breed from the ‘foot-n-mouth’ database. 
A query for sheep (rather than cows) looks as follows: 

SELECT key,name FROM Animal WHERE breed = 'sheep'; 

Cheap strings 

The following Haskell code encodes the parameterised query: 

selectBreed :: String -> SqlHandle SqlQueryResult 
selectBreed b = 

sqlQuery ( "SELECT key,name FROM Animal " 

++ "WHERE breed ='" ++ b ++ 

Here we use a low-level ODBC binding for database access. The 
query is wrapped in an SqlHandle type, which encapsulates an IO 
action for an ODBC connection. The query function is parame¬ 
terised in a String for the breed parameter. The type of query 
results is defined as follows: 
type SqlQueryResult = ([ColName],[Row]) 

type ColName = String 

type Row = [Cell] 

type Cell = String 

That is, the result of a query consists of a list of column names and 
a list of rows, where a row in turn is a list of cells. Both column 
names and cells are plain strings. This is painful code in the eye of 
most programmers, but it is often a cheap way to make things work. 
Prominent database access techniques for all kinds of programming 
languages are string-based just like that. 

Hand-made universes 

If we wanted to maintain at least the primitive datatypes of cells, 
then we could replace the use of the string type with a universe of 


cell types (or a tagged union): 



data Cell = intObject 

infe. / 


I FloatObject 

Float 


I StringObject 

String 


I ... — and perhaps a few mo 

re cases 


A row is still a list of such cells, but it is effectively a heterogeneous 
list. Instead of Int and intOb ject we can use types and tags that 
are more descriptive of the columns, as below. Clearly, such an 
application-specific universe is subject to change whenever the data 
dictionary changes. 


These are the types for the columns ii 

the ‘foot-n 

mouth’ database: 

newtype Kev 

= Key 

Integer 

deriving 

(Show,Eq,Ord) 

newtype Name 

= Name 

String 

deriviMgf' 

(Show,Eq) 

data Breed 

= Cow 

I Sheep 

de®i(ii!g- 

(Show,Eq) 

newtype Price 

= Price Float 

deriving 

(Show,Eq,Ord) 

data- Disease 

= BSE 

| FM 

deriving 

(Show,Eq) 


We derive Show, Eq, and Ord instances to allow for printing of query 
results, and comparison of cells in WHERE conditions. We redefine 
Cell such that it is complete for the ‘foot-n-mouth’ database. 

data Cell = KeyCell Key 
I NameCell Name 
I BreedCell Breed 
I ... — and cerlainly more cases 


The universal universe 

Rather than introducing problem-specific universes of cell types, 
we can employ dynamics [2, 3]. Haskell’s library Data.Dynamic 
provides the type Dynamic and an injection toByn as well as a pro¬ 
jection f romDynamic. Although this approach does not seem more 
typed, at least it is more extensible: we can make each new user- 
defined type amenable to injection and projection by providing an 
instance of Haskell’s type class Typeable. There is a fully equiv¬ 
alent alternative: we can use existentially quantified cell types to¬ 
gether with a nominal, extensible, type-safe cast [19], 

Using dynamic typing, the encoding of column names in cell types 
allows us to leave them out in the type of query results. A row ends 
up being a heterogeneous list of Dynamics. That is: 
type SqlQueryResult: 'ft'- [Row] 
type Row = HList 

type HList = [Dynamic] 

Using injection we construct an HList-typed value for cow Angus: 

attqtlS - [ toByn (Key 42) 

, toDyn (Name "AaquS") 

, toDyn Cow 
, toDyn (Price 75.5) ] 

We can process such HLiSts with ordinary list-processing func¬ 
tions, e.g., head, tail, null, and foldr. We can also provide 
type-based operations, e.g., an operation hOccursMany to retrieve 
all elements of a given type: 1 
hOccursMany :: Typeable a.^'ij^st -> [a] 
hOccursMany = map fromJust — unwrap Just 

. filter isJust — remove Nothing 

. map fromDynamic — get out of Dynamic 
For instance, we can attempt to look up the breed of cow Angus: 
ghci-or-hugs> hOccursMany angus :: [Breed] 

[Cow] 

Note that printing HLiSts such as angus requires extra effort. A 
value of type Dynamic is normally opaque. We can revise toDyn 
to include a Show constraint in addition to the Typeable constraint. 
Alternatively, we can provide a Show instance for Dynamic, which 
attempts fromDynamic towards all showable types that we can pos¬ 
sibly think of. These two options account for weak extensibility. 

Too few or too many types 

Most programmers are likely to loath operating on strings: it is 
completely untyped. A non-Haskell programmer might regard 
tagged unions as reasonably typed. The Haskell programmer will 
ask for much more typing. Most notably, the above type-based 
look-up gives no static guarantee that an element of the relevant 
type will be found at run-time. 

In database programming, these guarantees correspond to static 
checks on column access in WHERE phrases and elsewhere. Static 
checks would require a mapping of the data dictionary to Haskell 
types. For example, we could define one newtype per database ta¬ 
ble, with each newtype describing table columns as a product or 
a record. We can process values of these newtypes with generic 
functions [19]. However, we are stuck: it is not enough to have 
precise types for database tables. We also would need precise types 
for queries and their intermediate expressions. So we face the need 
for an open-ended set of product or record types. This challenge is 
addressed below. 


'To avoid confusion, we prefix all heterogeneously typed func¬ 
tions, types, and classes with an “h” (or, an “H”) such as in 
•JfSIcursMany and HList. 
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3 Typeful heterogeneous lists 

We seek a notion of heterogeneous lists that is more typeful than 
[Dynamic]. The type of a list should precisely describe the types 
of its elements, as a type sequence or product. This will allow us to 
make static promises, e.g., a guarantee that a look-up operation for 
a type delivers a result. As we will see, precision of typing does not 
impair our ability to define ‘normal’ list-processing functionality. 

Heterogeneous list constructors 

We start by defining datatypes for lining up type sequences: 
data HNil = HNil deriving (Eq,Show,Read) 

data HCons es : 5fc-*?= HCons e 1 deriving (Eq, Show, Read) 
These datatypes reify normal list structure at the type level, and 
thereby they allow us to statically distinguish empty and non-empty 
lists just as in dependently typed programming [12, 21]. Further¬ 
more, each list element may have a different type. 

For less parentheses, we assume right-associative infix operators: 
type e : *: 1 = HCons el — type lets! ddnstracfcot' 
e .*. 1 = HCons el — value level constructor 

Here is a type sequence for animals: 
type Animal = 

Key :*: Name :*: Breed :*: Price :*: HNil 
Here is a heterogeneous list that represents cow Angus: 
angus :: Animal — lafeional type declaration 
angus = Key 42 

.*. Name "Angus" 

.*. Cow 
.*. Price 
.*. KN:|S;.Y ‘ 

We note that heterogeneous lists are essentially nested tuples. So 
we could use the normal type constructors () and (,) instead of 
HNil and HCons as in: (Key, (Name, (Breed, (Price, ())))). We 
favour fresh datatypes for building heterogeneous list. This helps 
avoiding confusion and clashes with ‘normal’ applications of () 
and (,). We could also consider implicitly terminated type se¬ 
quences. Again, we require a terminating HNil to avoid a mess. 

A class of heterogeneous lists 

When using HCons such as in HCons e 1, we want the tail 1 to be a 
heterogeneous list type again. To this end, we will now work out a 
class HList whose extension is the set of all proper type sequences, 
i.e., the set of all nested, right-associative, binary products. This 
class replaces the type [ Dynamic) from the previous section, 
class HList 1 
instance HList HNil 
instance H&ffft-l => HList (HCons e 1) 

What is the purpose of this class? Some readers might wonder 
whether we want to constrain the type constructor HCons like that: 

data HList HCons = HCons e X deriving . . . 

After due discussion we decided: NO, being in good company [27]. 
The problem with constraints on datatypes is that they only imply a 
proof obligation, but type inference does not propagate them nicely. 
This would lead to a proliferation of HList constraints. 

We rather place HList constraints on list-processing functionality 
whenever we want them. A user of the HList library does not 
employ the unconstrained constructor HCons, but only a constrained 
version of it. To this end, we retype (.*.): 

(.*.) :: HList 1 => e -> 1 -> HCons e 1 
(. *.) - HCons 


List-processing operations 

Functions on normal lists (e.g., head, tail, and null) can be sys¬ 
tematically transposed to the type level. Normally, each type-level 
operation is subject to a dedicated class; see App. A for some ex¬ 
amples, and the HList source distribution for additional examples. 
Let us consider the recursive function for concatenation in some 
detail. For comparison, we recall normal list concatenation: 
append :: [a] -> [a] -> [a] 
append [] = Lei. 

append (x:1) = (:) x . append 1 
We define a class HAppend for concatenation of heterogeneous lists: 
class HAppend 1 1' 1" | 1 1' -> 1" 
where hAppend :: 1 -> 1' -> 1'' 

Here we use Haskell’s extensions for multi-parameter classes and 
functional dependencies — which, incidentally, were introduced 
for the sake of ‘normal’ collection libraries. So it is not surpris¬ 
ing that we end up using these extensions for heterogeneous collec¬ 
tions. The functional dependency 1 1' -> 1" indicates that the 
class is a type-level function — rather than a mere relation on types. 
The instances follow the definition of append very closely: 
dfisfca-nce HList 1 => HAppend HNil 1 1 
where hAppend HNil = id 
/jgiSfcanfcfi (HLl#t 1, HAppend 1 1' 1'') 

=> HAppend (HCons x 1) (HCons x 1" ) 
where hAppend (HCons x 1) = HCons x . hAppend 1 
We note that append’s equational term patterns show up twice in the 
class HAppend: once in the instance heads of HAppend and once in 
its method definitions. Also, the instance constraints for HList are 
like type checks to be performed at type checking ‘run-time’. But 
otherwise we transcribe list processing to the heterogeneous case in 
a systematic manner. There is just a constant factor of noise. 

Rather than defining all kinds of specific list-processing functions, 
one might wonder if the general recursion schemes for list process¬ 
ing can also be transcribed to the heterogeneous situation. This is 
indeed the case; see App. B for a heterogeneous fold operation and 
the HLIST source distribution for further higher-order operations 
on HLists. 

Aside: stanamic lists 

The class HList, and all the classes with list-processing operations 
(e.g., the shown HAppend) are in no way restricted to lists built 
from HNil and HCons. We can easily add instances for HList, 
HAppend, and others such that we also deal with less typeful het¬ 
erogeneous lists (e.g., [Dynamic]), or with less generic heteroge¬ 
neous lists (such as hand-made universes). This allows us to use our 
collection framework even in cases when the precise type sequence 
for a collection is not known statically, e.g., when collections are 
built from user input. One can even mix statically and dynamically 
typed collections. An advanced example of such a “stanamically” 
constrained data structure are the balanced trees in [18]. For the rest 
of the paper we will focus on statically typed heterogeneous lists. 

4 Numeral-based access operations 

We will now define array-like (or numeral-based) ac< e sc per t c i 
for HLists. That is, we will use type-level naturals to address list el¬ 
ements. These access operations provide a basic layer in the HList 
library because type-based and label-based access operations can 
actually be implemented in terms of numeral-based operations. 
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class HNat n 
class HNat :|f, ■ 
class HNat 0 ' '■ 
class HNats ns 
class HNats ns 


HLookupByHNat 
HDeleteAtHNat 
HUpdateAtHNat 
HProjectByHNats 
HSplitByHNats 


ns 1 1' 
ns 11' 1" 


ns 1 -> 1' 

ns 1 -> 1' 1" 


where hLookupByHNat :: n -> 1 -> e 

where hDeleteAtHNat :: n -> 1 -> 1' 

where hUpdateAtHNat :: n -> e -> 1 -> 1' 

where hProjectByHNats :: ns -> 1 -> 1' 

where hSplitByHNats :: ns -> 1 -> fl',1") 


Figure 1. Numeral-based access operations for heterogeneous collections 


Type-level naturals 

Type-level naturals are represented by datatypes for zero and suc¬ 
cessor function. These datatypes are solely for the type-level: the 
only value of these types is 

class HNat n 

data HZero; instance HNat HZero 

data HSucc n; instance HNat n => HNat (HSucc n) 

hZero : : HZero; hZero = _L 

hSucc :: HNat n n -> HSucc nf - feSucc- . = _L 
hPred : : HNat n ?*» HSucc n -> n; hPred _ = ± 

Eventually, one needs to perform all kinds of operations on type- 
level naturals such as arithmetics or comparison. As an example, 
we present (type-level) equality, as needed elsewhere in the paper. 
First, we need type-level Booleans: 

class HBool x 

data HTrue; instance HBool HTrue 
data .HFalse;;';Sl|3ta8efe HBool HFalse 
hTrue : : HTrue; JrT'rue = _L 
hFalse : : HFalse; hFalse - 

*«*• classes, for HAnd and Hftr :dSlifted 
Here are the classes for general type-level equality and comparison 
including the straightforward instances for the equality of naturals: 
class HBool b => HEq x y b | x y -> b 

class HBool b => HLt x y b | x y -> b 

instance HEq HZero HZer& HTrue 
instance HNat n => HEq HZero (HSucc n) HFalsei 

instance HNat n =*> HEq (HSucc n) HZero HFalse 

instance (HNat n, HNat n', HEq n n' 

=> HEq (HSucc rt) (H&ice nt'J h 
— litefiSe- for HLt 

Induction on type-level naturals 

One can define various access operations using naturals as indices; 
see Fig. 1 for an overview. For instance, the delete operation boils 
down to two instances: one for HZero; another for HSucc: 
instance HDeleteAtHNat HZero (HCons e 1) 1 

where hDeleteAtHNat _ (HCor.s_1) =1 

instance (HDeleteAtHNat nil', HNat n) 

=> HDeleteAtHNat (HSucc n) (HCons e 1) (HCons e 1') 
where hDeleteAtHNat n (HCons e 1) 

=■ HCons e (hDeleteAtHNat (hPred n) 1) 

Extra constraints 

Functionality on collections carries implied constraints due to all 
the involved access operations. In addition, one might want to add 
extra constraints. For instance, we can use the following class to 
restrict the maximum length of a list (or an array): 

class HMaxLength 1 s 

instance (HLength 1 s', HLt s' (HSucc s) HTrue) 

|j§ HMaxLength 1 s 

2 We also prefix all faked dependently typed functions and types 
with an “h” (or, an “H”) such as in hTrue and HBool. These types 
correspond to subsets of values of normal types such as Int, and so 
let us discriminate the subsets of values at compile time. 


class (HList 1, HNat n) => HLength !;»§•• 1 -> n 

instance HLength HNil HZero 

instance (HLength 1 n, HNat n, HList 1) 

=> HLength (HCosd a 1) (HSucc n) 

By adding HMaxLength constraints to signatures or instances, one 
instructs Haskell to enforce size boundaries at compile time. 

5 Extensible records 

We will now define labelled collections, i.e., maps from labels to 
values. In essence, we will employ type-level naturals for labels, 
but we will enrich the structure of labels for convenience of pro¬ 
gramming with labelled collections. We end up defining exten¬ 
sible records this way, without requiring the language extensions 
of earlier proposals. From the point of view of database access, 
records provide the ultimate expressiveness for mapping column 
names to values in a typeful manner. Extensibility (and shrinkabil¬ 
ity) of records is key to dealing with types of joins and projections. 

Haskell’s nonextensible records recalled 

In Haskell 98, we can define record types like this: 

data Unpriced = Unpriced { key :: Integer 

, breed :: Breed } 

Here is a unpriced cow Angus: 

BlipricedAngus = Unpriced { key = 42 

, breed = Cow } 

What are access operations that are available for Haskell 98 
records? We can retrieve components, and we can update records 
in a point-wise fashion: 

ghci-or-hugs> breed unpricedAngus 
Cow 

:gtiei-or-hugs> unpricedAngus { breed - Sheep } 
C’npriced.{key 42, r.ar.e "Angus", breed Sheep; 

We can not extend such records (unless we were thinking of nesting 
records and using polymorphic dummy fields for extension [6]). 
Also, we can not reuse labels among different record types, neither 
can we treat labels as data; so labels are not first-class citizens. 

An extensible record demo 

We place related labels in a namespace modelled by a silly datatype: 

data FootNMouth = FootNMouth — a namespace 
Labels in a namespace are constructed in a sequence starting with 
f irstLabel, with nextLabel generating the next distinguished la¬ 
bel. Each label is also annotated with a string for the label name. 
These are the labels for animals: 
key = firstLabel FootNMouth "key" 

name = nextLabel key "name" 

breed = nextLabel name "breed" 

price = nextLabel breed "price" 













We build the record for the unpriced cow Angus as follows: 

unpricedAngus = key .=. (42::Integer) 

breed .=. Cow 
.*. emptyRecord 

That is, record construction starts from emptyRecord; the label- 
value pairs are connected by “. =. and each label-value pair is 
added by using an overloaded operation “. *. 

Extensible records are printed more or less like Haskell 98 records: 


ghci-or-hugs> unpricedAngus 

Record{key=4 2,name="Angus",breed=Cow} 

We retrieve a component from a record as follows: 


ghci-or-hugsV'pSjpricecLAngus . ! . breed 
Cow 


We can update components as follows: 

ghci-or-hugs> urtjMSicedAngus .breed . = . Sheep 
Record{key=4 2,name="Angus",breed=Sheep} 

We can really extend such records: 
ghci-or-hugs> price .=. 8.8 .*. unpricedAngus 
Record{price=8.8,key=42,name="Angus",breed=Cow} 


One possible model of extensible records 

Labels can be implemented by type-level naturals, qualified by a 
namespace, and annotated by a string for the label name: 

data HNat x => Label x ns = Label x ns Strlfsj 
f irsLLabe1 = Label hZerp 

nextLabel (Label x ns _) = Label (hSucc x) ns 
Records are maps from labels to values. We could go for hetero¬ 
geneous lists of pairs; we could also go for pairs of heterogeneous 
lists of equal length. We abstract from this choice as follows: 
class HZip x y 1 | x y ->;•©, 1 -> x y 
where hzip :: x -> y -> 1 
hUnzip :: 1 -> (x,y) 

A record is a zipped list wrapped within Record: 

newtype Record r = Record r — to be constrained 
Record construction is constrained as follows: 
mkRecord :: (HZip Is vs r, HLabelSet Is) 

=> r -> Record r 
mkRecord = Record 

For instance, the empty record is denoted as follows: 

emptyRecord = mkRecord $ hjffcj) HNil HNil 
Labels in a record must be distinct: 

class HLabelSet Is 
instance HLabelSet HNil 

instance ( HNat n, HMember (Label n ns) HFalse 
, HLabelSet Is ) 

HLabelSet (HCons (Label n ns) Is) 

To this end, we define HEq-based membership test as follows: 
class HBool b => HMember e 1 b | e 1 -> b 
instance HMember e HNil HFalse 

instance ( HEq e e' b — compare e and head e' 

, HMember e 1 b' — use of label in tail 
, HOr b b' b" — type-level OR 
) => HMember e (HCons e' 1) b" 

We also extend equality, which was already defined for type-level 
naturals, such that we can compute equality of labels. Here we 
assume that the labels in a record are in the same namespace: 
instance HEq x x' b — compare naturalSvfi|: labels? 
•=>■ HEq (Label x r.s) (Label x' ns) b 


Access operations 

In the demo, we encountered access operations for look-up, update, 
and extension. There are also operations for appending records, 
for deletion of a label and its value in a record, for renaming of a 
label in a record, for projection and splitting of a record accord¬ 
ing to a label set. We can implement these operations directly on 
the representation of records (cf. “pair of lists” vs. “list of pairs”). 
Alternatively, we can use numeral-based access complemented by 
zipping and unzipping. 

For instance, deletion . ”) can be defined as follows: 

(Record r) .-. 1 = Record r' 
where (Is,vs) = hUnzip r 

• :rr = Wind 1 Is — uses HEq pit labels 

Is' = hBeIete.ALHNat:. ft Is 

vs' = hDeleteAtHNat n vs 

r' = hzip Is' vs' 

That is, we unzip the record; we find the index n of the given label 
1 in the list Is of labels; we delete the subscripted elements in the 
lists Is and vs of labels and values; we finally re-zip the record. 

6 Type-based access operations 

Numeral-based and label-based access is in some sense still value- 
based — even though we had to reify naturals at the type level. 
We will now work out truly type-based access operations. From a 
database perspective, type-based operations are useful when types 
are descriptive of columns. In that case, there is no need to employ 
label-to-value mappings. 

As for the coding style, we will make transient use of overlapping 
instances, as supported by the GHC and Hugs implementations of 
Haskell. We later circumvent overlapping instances. 

Filter an HList for elements of a given type 

The operation hOccursMany from Sec. 2 is an example of a type- 
based operation. The type of elements to be extracted from a list 
of dynamics is specified by fixing the result type of hOccursMany. 
We will now define such type-based operations on HList including 
more strongly typed ones; see Fig. 2 for an overview. 

We dedicate a class to hOccursMany: 

class HOccursMany e 1 where 'hOccursMany :: 1 -> [e] 
The instance for HNil returns [ ]: 

ujistiaace' HOccursMany e HNil where hOccursMany _ = [ ] 

Another instance deals with a non-empty HList whose head is of 
the type of interest; notice that e is used twice in the instance head: 
Instance (HIA^h 1, HOccursMany e 1) 

=> HOccursMany e (HCons e 1) 
where hOccursMany (HCons el) = e : hOccursMany 1 
There is yet another instance for a non-empty HList whose head is 
not of the same type as the element type in hOefepTs’s result type: 
instance (HList 1, HOccursMany e 1) 

=> HOccursMany e (HCons e' 1) 
where 'kOccpfsMaiiy (HCons _ 1) “iTjQigcursMany 1 
The two HCons instances are overlapping, while the former is more 
specific than the latter, which is thereby only applied when the for¬ 
mer is not applicable, i.e., whenever the types e and e' are different. 
hOccursMany is the regular “*” operation for type-based look¬ 
up. Then there are similar operations hOccursManyl (i.e., “+”), 
hOccursOpt (i.e., “?”), and hOccursFst (for the first occurrence). 
The class HOccurs and its complement HOccursNot require more 
thought. Most notably, a type-checked application of hOccurs is 
supposed to assure that there is exactly one element of the type in 
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class HOccursMany e 1 
class HOccursManyl e 1 
class HOccursOpt e 1 
class HOccursFst e 1 
class HOccurs e 1 
class HOccursNot e 1 


where hOccursMany 
where hOccursManyl 
where hOccursOpt 
where hOccursFst 
where hOccurs 


-> [e] 

-> (e,[e]) 
-> Maybe e 


— return as many occurrences of type e as there are 

— petarn ^t least one occurrence but all again 

— rs$£ft~ihe first occurrence if any 

— return the first occurrence out of one ore more 

— establish that there is precisely one occurrence 

— constraint-only class for lack of occurrences 


Figure 2. Type-based look-up operations for heterogeneous collections 


question. Successful type checking of hOccurs angus :: Breed 
implies that angus’s breed is defined unambiguously. We will de¬ 
velop the definitions of HOccurs and HOccursNot in detail. 

Documenting potential type errors 

At first sight, there is no HOccurs instance for HNil, but we can 
provide one — be it for the sake of instructive error messages. In¬ 
stances like the following make class-based dependently typed pro¬ 
gramming more manageable: 

instance 3?$-il (TypeNotFound e) => HOccurs e. HNil 
where hOccurs = _L 

Here we use a vacuous class Fail without instances, which just 
implements what its name promises, and we also assume a datatype 
TypeNotFound that serves for nothing but an error message: 
class Fail x —»methods, fid ffi'Sfances! 

data TypeNotFound e —■ fjj? Mfuss, no operations! 
Hence we obtain somewhat suggestive error messages: 
ghci-or-hugs> hOccurs (HCons True HNil) :: Int 
No instance for (Fail (TypeNotFound Int)) 

So we try to look up a value of a type that’s not in the list. Hence, 
iteration ends up at HNil, and TypeNotFound is reported. Such doc¬ 
umentary failure instances are used throughout the HList library. 

Static look-up 

We will now provide the actual definition of hOccurs. There are 
again two overlapping instances for non-empty lists; one for the 
case that the head fits with the type of interest, and another for re¬ 
cursion in case we haven’t found an occurrence yet: 
instance (HList 1, HOccursNot e 1) 

=> HOccurs e (HCons e 1) 
where hOccurs (HCons e _) = e 
instance (HList 1, HOccurs e 1) 

HOccurs e (HCons e' 1) 
where hOccurs (HCons _ 1) = hOccurs 1 
The constraint HOccursNot e 1 in the first instance assures that 
no elements of type e occur in the tail 1. The class HOccursNot is 
for constraining only rather than actual look-up. Consequently, its 
definition does not comprise any method: 
class pOfccursNot e 1 — no methods! 

data IfjseFound e — for a failure instance 

instance HOccursNot e HNil 
instance (HList 1, HOccursNot e 1) 

-?•. HOccursNot e (HCons e' 1) 
instance Fail (TypeFound e) 

•-> HOccursNot e (HCons e 1) 

The instances fold over 1 to test that each type is different from 
e. The last instance leads to failure for an offending head. This 
failure instance is obligatory because the more general instance for 
HCons would otherwise silently skip over the offending occurrence. 
Notice that Haskell’s instance selection is solely based on syntacti¬ 
cal matching. Hence, the failure of the more specific instance (via 
Fail) will not lead to reconsideration of the more general instance. 


From look-up to projection 

We can now readily define projection by mapping over a list of re¬ 
quested element types using simple look-up for each element type; 
see the HList source distribution for the actual code. For instance, 
the following query retrieves the key and the name of cow Angus: 
•icjkci-or-f.uqs> KProject ar.gus 

:: (HCons Key (HCons Name HNil)) 

HCons (Key 42) (HCons (Name "Angus") HNil) 

This operation resembles projection in the sense of relational al¬ 
gebra, or in the sense of SQL’s SELECT statements. (Think of the 
column names following the keyword SELECT.) 

Type-based mutation operations 

We also need mutation operations such as the following: 

• Delete list elements identified by their type. 

• Update list elements by values of the same type. 

• Split a list into a projected list and its complement. 

The update operation(s) mutate at the value level only, e.g.: 

— Replace the occurrences of type e 
class HUpdateMany e 1 

where hfipdateMany :: e -> 1 -> 1 
So the type-level programming bits of look-up can be adopted for 
type-preserving update. Deletion requires functional dependencies: 

— Delete the: occurrences of type retipffl' 

Class HDeleteMany ell' e 1 -> 1’ 

where hDeleteMany :: ... — to be completed 
Such mutation operations also mutate types. Without functional 
dependencies, users had to specify the result type explicitly, which 
is impractical. The trouble is that the combination of overlapping 
instances and functional dependencies leads us into murky water. 
We take this as an incentive to identify an overlapping-free idiom. 

Passing on types as proxies 

Let us first get the type of hDeleteMany right. It could be this one: 
Class HDeleteMany ell' e 1 -> 1' 
where hDeleteMany :: e -> 1 -> 1' 

The argument of type e would merely describe the type of the ele¬ 
ments that should be deleted. We might not have any suitable value 
around (except _L). Also, the above type obscures the role of the 
first argument. So we go for this type instead: 

hDeleteMany :: Proxy e -> 1 -> 1' 

Proxies are defined as follow: 

data Proxy e; proxy : : Proxy e; proxy = _L 
Hence, the only value of a proxy type is the specific value _L of the 
constructed proxy type — not to be confused with the value _Lof 
the type being proxied. We can reduce values to proxies if needed: 

toProxy : : e -> Proxy e; toProxy _ 

For example, we delete the name of cow Angus as follows: 
§!ici-or-hugs> hDeleteMany (proxy: :Proxy Name) angus 
HCons (Key 42) (HCons Cow (HCons (Price 75.5) HNil)) 
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A non-solution 

Adopting the style that we offered for look-up operations, we would 
want to implement hDeleteMany with one instance for HNil; one 
instance for ‘delete head’; one instance for ‘keep head’: 
instance HDeleteMany e: JtSil HNil 
where hDeleteMany _j ijjsELl = HnSS, 
instance (HList 1, HDeleteMany ell') 

=> HDeleteMany e (HCons e 1) 1' 
where hDeleteMany p (HCphs _ 1) = hDeleteMany p 1 
instance (HList 1, HDeleteMany ell') 

=> HDeleteMany e (HCons e' 1) (HCons e' 1') 
where hDeleteMany p (HCons e' 1) 

» BCons e' (hDeleteMany p 1) 

Alas, the two overlapping instance heads for HCons are in no substi¬ 
tution ordering. (Neither GHC nor Hugs can be persuaded to accept 
this code.) 

Move patterns from the head to constraints 

There is a rescue. We simply need to generalise one instance head 
so that it becomes more general than the other. Then, instance selec¬ 
tion will be re-enabled. We generalise the head of the last instance: 

• before: HDeleteMany e (HCons e' 1) (HCons e' 1') 

• after: Hi&leteMany e (HCons e' 1) 1'' 

But we must maintain the type equation 1" equals HCons e' 1'! 
To this end, we employ type cast. We add an instance constraint 
TypeCast CECims e' 1') 1", and we also cast in the method: 
instance ( HList 1, HDeleteMany ell' 

, TypeCast (HCons e' 1') 1" ) 

, *=>> HDeleteMany e (HCons a' 1) 1" 

where Jgg$3$teMany p (HCons e' 1) 

= typeCast (HCons e' (hDeleteMany p 1)) 

There is no shortage of type-safe casts for Haskell [34, 8, 4, 19]. 
The one we need here is really resolved at the type-level. So there 
is no Maybe involved, since typeCast cannot fail at run-time: 
class TypeCast x y | x -> y, y -> x 
where typeCast :: x -> y 

The functional dependencies capture our expectation of type cast to 
be an isomorphism on types (in fact, the identity function). We will 
discuss the implementation of TypeCast in Sec. 9. 

Ended up in murky water 

There is no real consensus on the overlapping instance mechanism 
as soon as functional dependencies are involved. Our result from 
above fits with GHC’s model, but Hugs reports that the instances 
are inconsistent with the functional dependency for HDeleteMany. 
Here is a simple example that exercises this disagreement: 
data Foo x y 
class Bar x y | x -> y 
class Zoo x y | x -> y 
instance Zoo y r => Bar (Foo x y) r 
instance Zoo z r => Bar (Foo (Foo x y) z) r 
Hugs’ type system misses the point that Bar's second parameter is 
still functionally dependent on part of Bar' s first parameter. 

Overlapping banned 

We give up on persuading Hugs. Also, we do not want to depend 
on the doubtful future of overlapping instances in general. Further¬ 
more, regimes for instance selection differ in ways other than con¬ 
sistency criteria for functional dependencies. For instance, GHC’s 
instance selection is lazy, whereas Hugs’ is eager. 


We avoid overlapping instances by reformulating our problem into 
a case selection driven by a type-level Boolean denoting a computed 
type equality. The predicate for type equality is provided as follows: 

class HBool b => TypeEq x y b | x y -> b 

proxyEq :: TypeEq t t' b => Proxy t -> Proxy t' -> b 

proxyEq_= _L 

We take for granted that we can define type equality; see Sec. 9. 
Using type equality, we replace the overlapping instances for 
HDeleteMany by the following case-preparing instance: 
instance. ( HList 1, TypeEq e e' b 

, HDeleteManyCase bee' 11' ) 

AS: HDeleteMany e (HCons e' 1) 1' 

where .hDeleteMany p (HCons e' 1) 

= hDeleteManyCase (proxyEq p (toProxy e') ) 

That is, we compute type equality so that we are able to decide 
whether the head needs to be deleted. This decision is then imple¬ 
mented by the helper class HDeleteManyCase with instances (i.e., 
branches) for the two Booleans: 

■Class HDeleteManyCase b e e' 1 1' | b e e' 1 -> 1' 
where 

hDeleteManyCase :: b -> Proxy e -> e' -> 1 -> 1' 
instance HDeleteMany sqSif:!' 

=> HDeleteManyCase HTrue e e 1 1' 
where hDeleteManyCase _ p _ 1 =• .fififeleteMany p 1 
irfeistg-ftOe HDeleteMany ell' 

=> HDeleteManyCase HFalse e e' 1 (HCons e' 1') 
where hDeleteManyCase _ p e' 1 
= HCons e' (hDeleteMany p 1) 

This idiom works equally well for other type-based operations. 

Type-to-natural mapping 

We can even factor out case discriminations for type equality to be 
used in just a single location, namely in a type-to-natural mapping. 
The remaining type-based access operations can then employ this 
mapping completed by numeral-based access. 

The type-to-natural mapping is hosted by the following class: 

class HNat n yty H*ype2HNat e 1 n | e 1 -> n 
The implementation adopts the overlapping-free idiom: 
instance (TypeEq e' e b, HType2HNatCase b e 1 n) 
pgj. HType2HNat e (HCons e' 1) n 
class (HBool b, HNat n) 

■s> HType2HNatCase b e 1 n | b e 1 -> n 
i'Sst-anfce HOccursNot e l 

=> i:?ype2t;\’atCase HT'tue e 1 HZero 
Instance HType2HNat h 1 n 

=> HType2HNatCase HFalse e 1 (HSucc n) 

We note that the first instance carries a constraint HCf&SaiSsfiot e 1. 
This makes sure that the type e in question is associated with a 
single natural as index. Alternatively, we could return a list of a 
indexes for elements of type e. This would be necessary for the 
reconstruction of operations like hOccursMany. 

For instance, type-based delete can now be expressed concisely in 
terms of numeral-based delete — without the hassle of a helper 
class for case discrimination on Booleans: 

hDelete p 1 AliD'feleteAtHNat (hType2HNat p 1) 1 
Here we invoke the type-to-natural mapping using this function: 
hType2HNat :: HTypeZHNat e 1 n => Proxy e -> 1 -> n 
hType2HNat _ _ = ± 
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Aside: type schemas and class-based programming 

The fine details of our heterogeneous collections reflect the em¬ 
ployment of Haskell’s class concept. Most notably, all involved 
type schemas must be sufficiently instantiated to allow for instance 
selection without causing ambiguities. This is just the same as in 
the case of show . read whose application to a string cannot be 
evaluated because the type of the intermediate result is not fixed. 
We can store and look up polymorphic values as long as their type 
schemas are not needed for instance selection. So numeral-based 
access works fine even for arbitrary polymorphic elements, because 
the element types do not drive instance selection: 
ghci-or-hugs> hLookupByHNat hZero (id .*. Hftfil) $ 42 
42 

The following type-based access still works: 

ghci-or-hugs> hOccursMany (id .*. HNil) :: [Bool] 

[] 

We note that hOccursMany compares its result type with all element 
types. The type schema forall a. a -> a of id is sufficiently 
instantiated for that, i.e., forall a. a -> a is different from Bool 
for all possible a. Here is an example of an ambiguous situation: 
ghci-or-hugs> hKMiirsMany (_L HNil) :: [Bool] 

No instance f&x ... <snipped> 

The interaction of polymorphic elements in collections and class- 
based programming will continue to be a topic in the next section. 

7 Type-indexed products 

As a refinement of type-based access to heterogeneous collections, 
one can even require that a given collection is entirely type-indexed, 
i.e., that no type occurs more than once. Imposing this requirement 
on lists, we obtain so-called type-indexed products (TIPs; [31]). We 
will now briefly describe an implementation of TIPs. The dual of 
TIPs, TICs, are defined in App. C. 

We wrap TIPs in a newtype so that we make the status of being type- 
indexed explicit in type signatures. Also, we can provide special 
instances for TIPs once we made this type distinction: 

newtype TIP 1 =* TIP 1 — to be constrained 

unTIP (TIP 1) =df> ' 

The public constructor for TIPs supplies the key constraint for TIPs: 

mkTIP :: HTypelndexed 1 => 1 -> TIP 1 
mkTIP * TIP 

The class HTypelndexed is defined as follows: 
class\fiSSIst 1 ~> HTypelndexed 1 
instance HTypelndexed HNijl'v ( 
instance (HOccursNot e i, riTypelndexed 1) 

•f>. HTypelndexed (HCons e 1) 

The instances traverse over the type sequence, and the class 
HOccursNot is employed to assure that the type of the head does 
not occur (again) in the tail. 

Let us upgrade angus to a TIP: 
ghci-or-hugs> let myTipyCow = TIP angus 

Lifting operations 

Most trivially, there is a replacement for HNil: 
emptyTIP = mkTIP HNil 

Operations on TIPs are lifted as follows, “tip” is unwrapped in ar¬ 
guments, and it is wrapped in the result (if this is a TIP), while con¬ 
straints are added so that the HTypelndexed property is enforced. 
For instance: 


instance (HAppend 11' 1", HTypelndexed 1") 

=> HAppend (TIP 1) (TIP 1') (TIP 1") 

Where hAppend (TIP 1) (TIP 1') = mkTIP (hAppend 1 1') 
Likewise we overload (. * .) to work for TIPs, i.e., extensions are 
assured to preserve the TIP property. To illustrate extension, we 
label myTipyCow with BSE: 
ghci-or-hugs> BSE .*. myTipyCow 
TIP (HCons BSE ...) 

The animal myTipyCow is a cow; so it can’t be a sheep then: 
ghci-or-hugs> Sheep .*. myTipyCow 
No instance for (Faili (TypeFound Breed)) 

Subtype constraints 

TIPs naturally give rise to a subtype property. One TIP type l is a 
subtype of another TIP type l' if / contains all types from This is 
expressed as follows: 

Mass SubType 1 1' 

instance SubType (TIS 1) (TIP HNil) 

instance (HOccurs e 1, SubType (TIP 1) (Tip 1')) 

=> SubType (TIP 1) (TIP (HCons el')) 

From this it is clear that we do not care about the order of elements 
in the type-indexed products. We also note that the intersection of 
HSubType x y and HSubType y x immediately provides a faithful 
form of type equivalence for TIPs (while mere equality of the un¬ 
derlying type sequences would not be faithful). 

As an aside, we can also instantiate subtyping for records. (This 
can be used in deriving an effective object system in Haskell.) A 
record type r is a subtype of some record type r' if r contains at 
least the labels of r' , and the component types for the shared labels 
are the same. Projection according to label sets is of use here: 
.•Ift.S-'tftnCe. ( HZ ip is vs r' 

, HPtbjectBylabels is (Record r) (Record r') ) 
SubType (Record r) (Record r') 

An idiom for constraint annotation 

Let us review idiomatic support for adding extra constraints. For 
instance, let us deploy a constrained hOccurs that is meant to re¬ 
turn the Key of an animalish TIP. TIPs that are not of a subtype of 
TIP Anfttsl are to be rejected — even if they carry a Key. This can 
be encoded as follows: 

animalKey :: ( SubType 1 (TIP Animal) — extra 

, HOccurs Key 1 — implied 

) => 1 -> Key 
animalKey = hOccurs 

The trouble is that this conservative approach forces one to gather 
all the implied constraints and to make them explicit just as the ex¬ 
tra constraints. There is an idiom that allows one to solely enumer¬ 
ate extra constraints. Essentially, one defines a constrained identity 
function that imposes the constraints of interest on its argument. 

The following identity function insists on animals: 
animalish :: SubType 1 (TIP Animal) => 1 -> 1 
animalish = id 

We can now discipline the Key getter as follows: 

-^ftirt#lKey 1 = hOccttrs (animalish 1) : : Key 
The subtype constraint takes action as one can see here: 
qhci*=or-hugs> animalKey myTipyCow 
Key 42 

ghci-or-hugs> animalKey (Key 42 .*. emptyTIP) 

No instances for (Fail (TypeNotFound Price), 

Fail (TypeNotFound Breed), 

Fail (TypeNotFound Name)) 







The error message lists the types that are missing from Animal. 


A polymorphism benchmark 

As proposed by a reviewer of this paper, we will now consider an 
example from [31], which is, in a way, about type-based matching. 
The following function selects two elements from a collection: 

tuple 1 = let x = hOccurs 1 

1' = hDeleteAtProxy (toProxy x) 1 
y = hOccurs 1' 
in (x,y) 

The following session shows that we can match the elements of 
a collection in whatever order, while the overloaded operations in 
tuple are resolved by the consumers of the matched values: 


ghci-or-hugs> 

ghci-or-hugs> 

ghci-or-hugs> 

ghci-or-hugs> 

ghci-or-hugs> 

ghci-or-hugs> 

(2,False) 

ghci-or-hugs> 

(False,2) 


let one = (1::Int) 

let inc x = x + one 

let incNot (a,b) = (ijj.C a, not b) 

let notlnc (a,b) = (not b,inc a) 

let oneTrue = (toys .*. True .*. HNil 

incNot (tuple pnajttue) 

notlnc (tuple %pTrue) 


The following example should arguably work, but it doesn’t: 

ghci-or-hugs> inc $ fst (tuple oneTrue) 

No instances for ... <snipped> 


We are going to make this work as well! We note that oneTrue 
stores two components; so by fixing the type of one component to 
Int, it should not matter that the type of the other component is left 
unspecified. The problem boils down to the following issue: 


ghci-or-hugs> hOspiurs (HCons True iffiil) 

No instance for (HOccurs e (HCons Bool HNil)) 


We would like to default e to Bool here. Rather than comparing 
the type of the head with a not yet instantiated result type, the two 
types should be unified. The hOccurs operation for TIPs does this: 


ghci-or-hugs> 

True 

ghci-or-hugs> 

ghci-or-hugs> 


hOccurs (True .*. emptyTIP) 

let oneTrue = one .*. True .*. emptyTIP 
inc $ fst (tuple oneTrue) 


Even the following added polymorphism is handled: 

ghci-or-hugs> let qfieMull*• [] emptyTIP 
ghci-or-hugs> inc $ fst (tuple oneNull) 


The key idea is to provide a special instance for singleton lists, and 
to replace the test for type equality by unification via type cast: 
instance TypeCast e' e 

A* HOccurs e ffiP (HCons e' HNil)) 
where hOccurs (TIP (HCons e' _)) «*•-typeCast e' 
instance HOccurs e (HCons x (HCons y 1))' 

=> HOccurs e (TIP (HCons x (HCons y 1))) 
where(TIP 1) = hOccurs 1 
This example reveals that type cast provides a powerful idiom for 
type improvement — a more fine-grained one than functional de¬ 
pendencies. That is, type cast operates at the instance level as op¬ 
posed to the class level! 


8 Database programming 

We will now demonstrate heterogeneous collections for database 
programming in Haskell. To this end, we adopt concepts from Lei- 
jen and Meijer’s embedding approach for SQL [20]. We employ 
extensible records for two purposes: 


• to represent the results of queries, and 

• to represent schemas for relational algebra operations. 

A detailed discussion of the approach is beyond the scope of this pa¬ 
per. We note however that the approach scales to the full relational 
algebra, and to a rich set of SQL idioms including all kinds of joins, 
existential quantification, nested queries, and table statements. 

We recall the simple query from the beginning of the paper: 

SELECT key,name FROM WHERE breed = 'sheep'; 

In Haskell, we can now write this query in a type-safe manner. 
selectBreed b = — argument $ for the breed 
do rl <- table aniffialTable 

M <- restrict rl (\r -> r .!. breed 'SQL.eq' b) 
r3 <- project r2 (key .*. name .*. HNil) 
doSelect r3 


Type inference works fine, but here is the type of the query anyway: 
selectBreed :: Breed -> Query [ 

Tkey :=: Animalld 

Tname :=: String HNil ] 

That is, the result is a query for records with two components. (The 
types for the labels key and name are denoted by Tkey and Tname.) 
The above do sequence encodes the SQL query in four steps: 

• rl: We identify the table as in “FROM Animal”. 

• r2: We restrict the table according to the WHERE condition. 

• r3: We perform projection as in “SELECT key, name”. 

• doSelect r3: The actual query is issued. 

Steps 1-3 do not involve any database access. (Monadic style is 
used for hygienic name supply.) The operations table, restrict, 
project create or modify type-annotated, syntactical expressions 
for relations. The underlying key data structure looks as follows: 


data Relation schema — type sanotatloh layer 
= Relation schema Sq! Relation 


data SqlRelation 
= SqlRelatiaii;. 
rTag 
rSource 
rRestrSg|List 
rPtojeetList 
rGrpupList 
rOrderfciftSt 


— expression layer 


SqlTag, 

SqlSource, 
[SqlExpresslonj,; 
[SqlExpression]-, - 
[SqlExpression], 
[SqlExpression] } 


That is, relations carry a schema, and their structural ingredients 
comprise a unique tag, a source (i.e., a database table), as well as 
lists of expressions describing restrictions (cf. WHERE), projections 
(including computed columns), grouping and ordering. 


The type of the relational schema for animals is the following: 


type AnimsfliSbtoRS. = 

Tkey :=: Attribute'Animalld Sqllnteger 
Tname :=: Attribute String SqlVarchar 
Tbreed :=: Attribute Breed Sqlvarpha# 
Tprice : = : Attribute Float SqlNumeri.tj 
ifarm : = : Attribute Farmld Sqllnteger. 


HNiJ 


The schema type lists both the domain of a column and the cor¬ 
responding SQL type. Lor instance, the Haskell type for the key 
component is the newtype Animalld rather than the SQL type 
Sqllnteger. This ‘domain as newtypes’ technique increases type 
safety: one cannot possibly confuse an Aiiimpjd and a Farmld. 
We note that some of the column types could be wrapped in Maybe, 
but this is not the case for AnimalSchema. 


The datatype Attribute is a phantom type in its two type 
parameters. These phantoms drive coercions and make at¬ 
tribute access type-safe. Lor instance, consider the subexpression 
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r . ! . breed 'SQL.eq' b for restriction in the above query. The 
look-up r . ! . breed does not just establish that there is a breed 
component, but it also delivers a phantom-typed attribute, so that 
its use in the compound expression is type-constrained. 
Structurally, attributes keep track of some details such as precision, 
and NULL constraints. All such information is extracted from the 
data dictionary of a database. 

Here is a snippet of the extracted table description for animals: 

animalTable :: Table AnimalSchema 
animalTable = mkTable ( 

key .=. Attribute { ... } .*. 

name .=. Attribute { ... } .*. 

... HNil ) 

This is all what’s needed to make attribute access type-safe. Re¬ 
turning typed query results relies on further provisions. That is, the 
action doSelect for executing a query has to recast query results 
such that they are phrased in the Haskell types for column domains. 
The code for the execution of SELECTS makes it all clear: 
doSelect (Relation schema rel) = do 
sqlDo (showSqlRelation re.:} 
rows <- getSqlRows 
return $ map ( labelRtist, labels 
. readHList values 

where (labels, values) .Slftzip schema 
The subexpression showSqlRelation rel computes the SELECT 
statement as a string, which is then given to sqlDo — the low-level, 
ODBC-based SQL handler. In the next step, we get all the queried 
rows as a lazy list of lists using this SQL service: 

qetSq.l Rows : : SqlHandle [ [Maybe String] ] 

The subsequent map transforms the string-based rows into typeful 
ones in two steps. Firstly, we build an HList from the strings with 
readHList, while we use the attributes from the schema to drive 
this heterogeneous list construction. Secondly, we turn the HList 
into a record, while we reuse the labels of the schema. 

9 By chance or by design? 

We will now discuss the issues surrounding the definition of type 
equality, inequality, and unification — and give implementations 
differing in simplicity, genericity, and portability. 

We define the class TypeEq x y b for type equality. The class re¬ 
lates two types x and y to the type HTrue la case the two types 
are equal; otherwise, the types are related to HFalse. We should 
point out however groundness issues. If TypeEq is to return HTrue, 
the types must be ground; TypeEq can return HFalse even for un¬ 
ground types, provided they are instantiated enough to determine 
that they are not equal. So, TypeEq is total for ground types, and 
partial for unground types. We also define the class TypeCast x 
y: a constraint that holds only if the two types x and y are unifi- 
able. Regarding groundness of x and y, the class TypeCast is less 
restricted than TypeEq. That is, TypeCast x y succeeds even for 
unground types x and y in case they can be made equal through 
unification. TypeEq and TypeCast are related to each other as fol¬ 
lows. Whenever TypeEq succeeds with HTrue, TypeCast succeeds 
as well. Whenever TypeEq succeeds with HFalse, TypeCast fails. 
But for unground types, when TypeCast succeeds, TypeEq might 
fail. So the two complement each other for unground types. Also, 
TypeEq is a partial predicate, while TypeCast is a relation. That’s 
why both are useful. 

A representation-based equality predicate 

The predicate TypeEq x y b was introduced in Sec. 6 as follows: 


class HBool b => TypeEq x y b I x y -> b 

We now need to provide instances of the class. A very naive im¬ 
plementation would be to explore all combinations of all possible 
types; see the HList source distribution for an illustration. Albeit 
being portable (Haskell 98 + multi-parameter classes), this leads to 
an impractical, exponential explosion in the number of instances. 
A more scalable approach is to introduce a family of infinite types 
for type-level type representations. That is, we associate types with 
type representations via a bijection, and we make sure that type rep¬ 
resentations are more easily compared than the types themselves. 
We already have all tools for constructing the family of type rep¬ 
resentations; we can associate with each type constructor an HNat, 
and associate with each type term an HList of the representations 
for the type constructor and its arguments. For instance, using ‘0’ 
for Bool, ‘1’ for Int, ‘2’ for ->, we obtain: 
class TTypeable a b | a-> b 
instance TTypeable Bool (HCons HZero HNil) 

.instance TTypeable Sift (HCons (ESucc HZero) HNil) 
ibsba-nc'e (TTypeable a TTypeable b bl) 

=> TTypeable (a->b) (HCons (HSucc (HSucc HZero))' 
(HCons al (HCons bl HNil))) 

Because these type representations are constructed in a regular way 
with ever-increasing naturals, it is sufficient to accommodate type- 
level equality such that it can compare heterogeneous lists of type- 
level naturals. Type-level equality for naturals was given in Sec. 4. 
Here are the remaining instances for HNil and HCons: 
instance HEq HNil HNiJ HTrue 
instance HList !, =>> HEq HNil (HCons Wi) HFalse 
instance HList §,•#• HEq (HCons e 1) HNil HFalse 
'Instance ( HList l r HList 1' 

, HEq e e' b, HEq 1 1' b', HAnd b b' b'' 

) => HEq (HCons e 1) (HCons e' 1') b" 

All the involved functionality does not go beyond Haskell 98 and 
multi-parameter classes with uni-directional functional dependen¬ 
cies. GHC and Hugs readily support this combination. 

We can now define the class TypeEq, using the following instance: 
ahstance ( TTypeable t tt, TTypeable t' tt' 

, HEq tt tt' b ) TypeEq t t' b 
We make use of a generic instance, which is a common Haskell 98 
extension. In turns out that we have essentially transposed what’s 
known as the Data. Typeable approach [19] to the type level. We 
share the drawback of this approach: we need to define an instance 
of TTypeable for each new type constructor. When adding new in¬ 
stances, we have to maintain the bijection between types and type 
representations. On the other hand, the remaining code is fully 
generic and does not need to be amended at all. 

A generic type equality predicate 

We have seen that we can implement TypeEq in a portable and even 
practically usable way, using only commonly supported Haskell 
extensions. We would like to introduce a fully generic approach, 
which does not need to be amended when a new type constructor 
is introduced. Alas, this elegant approach leads us out of the safe 
haven into uncharted waters of experimental extensions. 

The most concise implementation reuses the overlapping tricks that 
were discussed in Sec. 6, which makes the solution GHC-specific: 
TypeEq x x HXnje 

Instance (HBool b, TypeCast HFalse b) 

TypeEq x y b 

Here we take advantage of TypeCast, which we define next. 
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Reification of type unification 

The class TypeCast was introduced in Sec. 6 and further em¬ 
ployed in Sec. 7. TypeCast x y differs from just type equality 
TypeEq x ff jljrue as follows. If TypeCast x y succeeds, then 
the two types are unified. The difference between unification and 
just equality emerges when the types are not grounded, i.e., when 
they contain uninstantiated type variables. The types [a] (e.g., of 
the polymorphic constant [ ]) and [Bool] are unifiable, but they are 
not equal. TypeEq cannot establish equality for ungrounded types; 
however it can establish disequality in case the schemas are suffi¬ 
ciently instantiated to determine that they are not equal. 

The most generic implementation of TypeCast, which works for 
both Hugs and GHC, is as follows: 

instance TypeCast x x where typecast = id 
For this implementation to work, we need to import it at a higher 
level in the module hierarchy than all clients of the class TypeCast. 
Otherwise, type simplification will turn constraints of the form 
TypeCast x y into the form TypeCast x x, and thereby inline the 
unification. We refer to App. D, where we give another implemen¬ 
tation of TypeCast, which does not require separate compilation. 
This time, we effectively delay the simplification step with the help 
of two auxiliary classes. It seems that this delay of type simplifica¬ 
tion is at the core of all attempts at type-safe cast or type equality 
(e.g., [4]). 

A specific property of our TypeCast is that it allows us to con¬ 
trol type improvement on a per-instance basis, as the polymorphism 
benchmark for TIPs showed in Sec. 7. So the utility of TypeCast 
goes strictly beyond a generic implementation of TypeEq. 

10 Related work 
Heterogeneous lists 

Type-level list-processing is a relatively obvious opportunity once 
we get hold on faked dependently typed programming in Haskell, as 
pioneered by Hallgren and McBride [12, 21]. For instance, homo¬ 
geneous type-level vectors are considered in [21], The idea of het¬ 
erogeneous type-level constructors (what we call HNil and HCons) 
occurs elsewhere in the literature. In App. H of [9], Duck et al. mo¬ 
tivate their CHR-based model of functional dependencies by op¬ 
erating on such lists using numeral-based access (similar to our’s 
in Sec. 4); Sulzmann also gives a related implementation in the 
Haskell-style language Chameleon [33]. In [22,23], Neubauer et al. 
motivate Haskell extensions for a functional notation of functional 
dependencies, and for functional logic overloading. The authors 
consider examples like type-level functions append and length, as 
well as record-like operations. By contrast, our goal was to explore 
the various kinds of access operations for heterogeneous collec¬ 
tions: list processing, numeral-based, label-based, and type-based 
operations. HLlST is the first heterogeneous collection library to 
the best of our knowledge. 

Type-indexed rows 

Shield and Meijer have studied the type theory of extensible records 
and variants starting from a more basic principle, namely type- 
indexed rows (TIRs) [31]. A TIR is nothing but a type expression 
that enumerates types. This resembles HLists, but TIRs do not 
comprise any values. So we could go for constructor-less datatypes: 
class IIK r 

data Empty; instance TIR Empty 

data e :#: r; instance TIR r => TIR (e :#: r) 

A TIR is well-formed if the enumerated types are distinct. Well- 
formedness corresponds to our HTypelndexed constraint. Shield 


and Meijer provide type-level operators ALL and ONE that, given 
a TIR, derive types for type-indexed products (TIPs; recall Sec. 7) 
and type-indexed co-products (TICs; see App. C for the HList im¬ 
plementation of TICs). We could redefine our datatypes for TIPs 
and TICs such that they take a TIR as parameter, but these def¬ 
initions and their usage would be more complicated in Haskell. 
Shield and Meijer argue that, conceptually, a newtype-like mecha¬ 
nism is sufficient for labelling. Our development provides labels as 
first-class citizens, and we can provide labelled collections without 
reference to general type-indexing (i.e., numeral indexing is suffi¬ 
cient). Our Haskell-based reconstruction of TIPs and TICs does not 
require new language extensions. 

Extensible records 

Foundations of extensible records have been studied intensively. 
Several Haskell language extensions have been proposed [10, 31, 
29], alike for other languages, e.g., (S)ML [6, 30]. There are also 
record calculi by Bracha, Ohori and others [5, 24], There are re¬ 
lated type systems, e.g., for relational algebra [14]. We have shown 
that we can reconstruct extensible records in Haskell starting from 
simpler notions; in particular: heterogeneous lists and equality and 
of type-level naturals. We cover all typical record operations. We 
have also defined subtyping constraints in our framework. 

Labels, values and records are all first-class citizens in HList. So 
we can write abstractions that take and produce entities of all these 
kinds. For instance, here is an operation to rename a record label: 
hRenameLabel 11' r = r'' where 

v = r |;» ! — look up by label- 

r' = r _ -** delete at label 

r". = . v .*. r' — add new label, old value 

Type equality and type cast 

In our development of heterogeneous collections, we rely on ob¬ 
servability of type equality. Also, we employed a reified type 
unification (‘type-level type cast) in a few places. Related ex¬ 
pressiveness has been studied in the context of intensional poly¬ 
morphism [13], dynamic typing [2, 3], and universal representa¬ 
tions [36]. Some more recent Haskell-biased work on these no¬ 
tions [34, 8, 4] is not directly usable for our purposes. These ap¬ 
proaches either require the programmer to use type representations, 
or they make a closed-world assumption with regard to the cov¬ 
ered types, or they are focused on sums-of-products (as opposed 
to the immediate coverage of Haskell’s newtypes and datatypes), 
or they involve existential quantification (which makes it difficult 
to perform more arbitrary operations on elements in the collec¬ 
tions). Most notably, we require a type cast that is resolved at type¬ 
checking time; run-time would be too late. 

Haskell’s type classes 

Multi-parameter classes [7, 15, 16, 28] with functional dependen¬ 
cies [17, 9] are crucial for type-level programming in Haskell. 
These typing notions are reasonably understood. There is an on¬ 
going debate if instance selection should be programmable by us¬ 
ing constraint-handling rules or functional logic evaluation [32, 23], 
Also, the mere notation for encoding type-level functions could per¬ 
haps be improved [22]. We have considered using overlapping in¬ 
stances for the definition of some access operations, but ultimately 
we eliminated use of this debated extension in a systematic manner. 

Statically enforced invariants 

The TIP newtype is an example of a data structure with a stati¬ 
cally checked invariant (i.e., uniqueness). Okasaki and others have 
worked on statically assuring invariants of complex data types, e.g.. 




that a matrix is square [25], These examples normally rely on clev¬ 
erly chosen data constructors, which make it impossible to con¬ 
struct “wrong” data structures. Our approach is different: type 
classes let us impose static constraints irrespective of data construc¬ 
tors. Indeed, we use the same data constructor HCons to build het¬ 
erogeneous lists with and without duplicates. We express the con¬ 
straints in types (sometimes, in phantom types). Our approach does 
not require extraordinary cleverness in the design of data represen¬ 
tation. Furthermore, in the case of constraints encoded in phantom 
types, there is no run-time or -space overhead of storing and travers¬ 
ing chains of data constructors (TIP is just as efficient as HList). 
Because TIP is essentially HList, we were able to trivially lift all 
list-processing functions to TIPs. Statically checking complex in¬ 
variants on data structures, such as well-formedness of red-black 
trees and size-boundaries of lists, is a known application of depen¬ 
dency typed programming [35]. The latter requires non-trivial ex¬ 
tensions to a programming language. We have shown that certain 
invariants, e.g., size boundaries for HLists, or uniqueness in TIPs, 
can be statically expressed in Haskell’s type system already. 

11 Conclusion 

We have systematically developed a Haskell library over strongly- 
typed data structures for heterogeneous collections — lists, arrays, 
extensible records, and others. The composition of such a data 
structure, e.g., the types of all elements, is manifest in its type. This 
makes it possible to strongly type the operations on collections, e.g., 
look-ups, updates, insertions, and projections. The name of the 
library, HList, emphasises that all data structures are built from 
typeful heterogeneous lists. We have defined restricted collections, 
e.g., TIPs, constrained by the requirement that no two elements may 
have the same type. The constraints are again manifest in the type 
of the collections and are enforced by the type checker. 

The immediate application of our HList library is a database ac¬ 
cess library that covers SQL92, returns the query results as a stream 
of records, and statically checks that all the queries are consistent 
with the database schema. 

The implications of the library HLIST turn out far reaching, and are 
still under active investigation. Our TIPs and records are extensible 
and offer subtyping polymorphism. Our records have first-class la¬ 
bels that can be reused across several record types. We notice that 
HLIST is implemented in Haskell with only common extensions. 
Hence the HList library addresses the challenge for better Haskell 
records, without breaking existing programs, as articulated by Si¬ 
mon Peyton Jones at the Haskell Workshop 2003 [11], Our records 
also let us implement hasllacks, record concatenation, length vs. 
depth subtyping. We can now experiment with these features in 
real programs — again, without requiring any language extension. 
Extensible TIPs and records can be the foundation of the gen¬ 
uine object system. The latter offers subtyping polymorphism (cf. 
OCaml) as opposed to the class-bounded polymorphism of Haskell. 
It is remarkable that type classes themselves were instrumental in 
implementing open TIPs. Extensible records can also be elabo¬ 
rated to provide strongly typed keyword arguments with reusable 
labels. That is, function arguments can be addressed by keywords, 
and these arguments can be optional or mandatory. The HLIST 
source distribution demonstrates keyword arguments. Dual to TIPs 
are open TICs, offering us dynamics with a statically-checkable 
constraint on the sort of types encapsulated in the dynamic enve¬ 
lope (cf. App. C). The lists, TIPs, TICs and records of the HList 
library can also be employed in typeful foreign-function interfaces 
and in XML processing. 

Our code relies on the most common Haskell extensions; the use of 
overlapping instances can be circumvented. In fact, a generic im¬ 


plementation of the predicate TypeEq for type equality would still 
rely on overlapping in a single location. We can also implement 
TypeEq in a portable but non-generic manner relying on one in¬ 
stance per user-defined datatype. Our development suggests that a 
fundamental solution could be to offer type equality as a primitive 
in Haskell. We have also identified the utility of reified type unifi¬ 
cation (or ‘type-level type cast’) as a tool for type improvement — 
more fine-grained than functional dependencies. More research is 
needed to deliver foundational clarifications. 
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A Some trivial list-processing operators 

We will now transpose several normal list-processing operators to 
the heterogeneous situation. 

Transposition of head and tail 

class HList r => HHead r a | r -> a 
where hHead :: r -> a 
instance Pail HListEmpty => HHead HNil () 
where hHead _ = () 

instance HList r => HHead (HCons a r) a 
where hHead (HCons a _) = a 
Class (HList r,HList r') -Srlfsijl r r' | r ->/#' 
wherfe.-Krail : : r -> r'f" 
instance Fail HListEmpty -> HSgSl HNil HNig/ 
where Mail _ = HNil 
instance HList r =* j!jail (HCons a r) r 
where..Kfiil (HCons _ r) iStpf? 

In the above instances, we use the same technique for error mes¬ 
saging as explained in Sec. 6. That is, we employ the Fail class to 
handle invalid applications of the operations. In particular, there is 
an error message HListEmpty, whenever we attempt to access an 
empty list where a nonempty list is needed. Thus, we have: 

.class Fail i — no instances! 

data HListEmpty — no nuetwe! 

Transposition of null 

Class Jpjb^l b => HNull lb | 1 -> b 
..instance ffl&ll HNilujJTrue 
■instance HNull (HCons e 1) HFalse 

Transposition of length 

class HNat rt => HLength 1 n | 1 -> 

-instance HLength HNil HZero 
‘J^tStqnce HLength 1 n 

=> HLength (HCons e 1) (KSuec n) 
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B A heterogeneously typed fold operator 

We go for the fold operation because it is the ultimate example of a 
higher-order list-processing function. We dedicate a class HFoldr 
to right-associative folding. The HFoldr instances will lift the 
defining equations for f oldr to the class level: 

class HList 1 => HFoldr f v 1 r | f v .1 -> r 
where 

hFoldr : : f ->:• '%■ -> 1 
The instance for empty lists is trivial: 
instance HFoldr £ f HNil v 
where 

hFoldr _ v _ = v 

The instance for HCpaa follows the normal foldr again, but we 
assume that function application is modelled by an extra class 
HApply. This allows us to use hFoldr for functions that require 
specific constraints on the involved types: 

instance (HFoldr f Sr 1 r, HApply f (e,r) r') 

-•> HFoldr f v (HCons e 1) r' 

hFoldr (HCons e 1) = hApply f £ t 1) 

The class HApply resembles function application, indeed: 
class HApply f | f a -> r 
where 

hApply :: f -> a -> r 

For instance, we can now redefine hAppend in terms of hFoldr: 

hAppend 11'= hFoldr ApplyHCons 1' " 

The datatype ApplyHCons stands for “application of HCons”: 
data ApplyHCons = 

ApplyHCons — a proxy for instance selection 
This meaning of ApplyHCons is registered as an HApply instance: 
instance HApply ApplyHCons (e,l) (HCons e 1) 
where 

hApply _ (e,l) = HCons e 1 

C Type-indexed co-products 

We will now dualise TIPs to arrive at so-called type-indexed sums 
(or co-products; TICs). A TIC-typed data structure holds a datum of 
one out of a fixed collection of types. So at the value level, a TIC- 
typed data structure is not really a collection, but just one datum. 
However, at the type level we use a list of type proxies to maintain 
the valid element types of a specific TIC type, and thereby we can 
restrict construction and destruction of TIC-like data structures. 

A TIC demo 

We first define an actual TIC type, namely one that models various 
element types for collections related to the animals in the ‘foot-n- 
mouth’ database: 

type Ajji.ns(i6oi ^ 

Key :+: Name :+: Breed :+: Price :+: HNil 
Here we use “: +: ” rather than “: * : ” to point out that we are inter¬ 
ested in a type-indexed co-product rather than a product. We can 
now construct actual TIC-like data. For instance: 

ghci-or-hugs> let myCol = mkTIC Cow : : TIC : 'SititnalCol 
We can also destruct myCol. If we ask for the ‘right’ type, then 
destruction succeeds with a result of the form Just ...; otherwise 
we obtain Nothing: 

ghci-or-hugs> unTIC myCol :: Maybe Breed 
Just Cow 

ghci-or-hugs> unTIC myCol :: Maybe Price 
Nothing 


Most notably, TICs restrict destruction with regard to static typing: 

:qhci-or-hugs> unTIC myCol : : Maybe 
Type error ... 

Sequences of type proxies 

We used the alias “: +: ” above to enumerate the summands of a TIC 
type. In fact, “: +: ” is constructed such that it lines up proxy types 
in a sequence. Value types would be misleading and confusing here 
because the sequence of summands is meant for nothing but listing 
‘options’. So the alias is defined as follows: 
type e : + : 1 = HCons (Proxy e) 1, 

The actual property of a type sequences to consist only of proxy 
types is easily specified. 

Glass HTypePr«Mt|,ed 1 
instance HTypeProxied HNil 
instance HTypeProxied - 

=> HTypeProxied (HCons (Proxy e) 1) 

TICs as constrained dynamics 

The demo suggests that a TIC is more constrained than the type 
Dynamic. So in turn, one can define more constrained collection 
types than just [Dynamic] or String -> Byba^ie, There exist 
different implementations of TICs, but we will favour here one that 
indeed directly employs Haskell’s dynamics at the value level. 

A TIC type is then of the following form: 

data S.SS: 1 = ®SC Dynamic — to be constrained 
The phantom type parameter 1 of TIC enumerates the admitted 
types that can be injected into this TIC, and that can be subject to 
extraction attempts. The public constructor for TICs (aka injection) 
lists all the necessary constraints: 
mkTIC :: ( HTypelndexed 1 
, HTypeProiJti'fed 1 
, HOccurs (Proxy i) 'j|- 
, Typeable i 
) 

=> i .=> Tip 1 
mkTIC i = TIC (toDyn i) 

The HTypelndexed and HTypeProxied constraints require that 1 is 
a type-indexed sequences of type proxies. The HOccurs constraint 
ensures that the proxy type of the injected value i is covered by the 
sequence of proxies 1. Finally, the Typeable constraint allows us 
to use Haskell’s module Data. Dynamic. 

It remains to define destruction (or projection), which happens to 
simply invert the constrained value-to-dynamics conversion: 

UnTIC :: ( HTypelndexed 1 
, HTypeProxied 1 
, HOccurs (Proxy o) 1 
, Typeable o 
) 

=> TIC 1 -> Maybe o 
■jjtiTfiS (Tf® i) = fromDynamie i 


14 







D Generic type unification cont’d 

The class TypeCast was described in the subsection ’Reification of 
type unification’ of Sec. 9. 



That section showed the most straightforward implementation of 
that class: a single instance TypeCast x x with the method 
typeCast being just the identity. However, that simple implemen¬ 
tation was difficult to use. Separate compilation had to be put to use 
in some tricky way. Indeed, recall the following example of using 
TypeCast from Sec. 7: 
instance TypeCast e' e 

HOccurs e (TIP (HCons e' HNil)) 
where hOccurs (tl# (HCons e' _)) = typeCast e' 
When the compiler sees the instance TypeCast x x and combines 
that with the functional dependencies a->b, b->a of the class, the 
compiler infers that the two parameters of TypeCast must be the 
same. That conclusion is correct — the type cast is meant to be an 
isomorphism on types (in fact, the identity function). What is trou¬ 
blesome is that the type checker applies that conclusion — as a type 
simplification rule — to the HOccurs instance above and infers that 
e must be e'. That is a problem however: if a type signature con¬ 
tains distinct type variables, one should be able to instantiate them, 
at least in principle, with distinct types. Otherwise, the inferred 
type is less polymorphic than the explicit signature prescribes. 

This is the same sort of error that arises in the following code: 



When processing the instance declaration HOccurs, the compiler 
eagerly applies the correct type simplification rule - the two pa¬ 
rameters of TypeCast must be the same - and infers that two type 
variables e and e' must be the same. The eagerness creates the 
problem. We would like to delay the type simplification until af¬ 
ter the instance HOccurs has been selected and e and e' have been 
instantiated. In other words, we would like to unify the types that 
e and e' are instantiated with, rather than the two type variables 
themselves. 

To keep the compiler from applying the type simplification rule too 
early, we should prevent the early inference of the rule from the 
instance of TypeCast in the first place. For example, we may keep 
the compiler from seeing the instance TypeCast x x until the very 
end. That is, we place that instance in a separate module and import 
it at a higher level in the module hierarchy than all clients of the 
class TypeCast. That was the approach described in Sec. 9. 

We will now give another implementation of TypeCast, which does 
not require separate compilation. It effectively delays the simplifi¬ 
cation step with the help of two auxiliary classes. 

Our new implementation must keep the semantics of the constraint: 
TypeCast a b should hold if and only if the type corresponding 
to a can be unified with the type corresponding to b. On the other 
hand, we need to allow for polymorphism and pretend that in a 
constraint TypeCast a b, b may be something other than a — so 
to keep the typechecker from unifying the type variables a in b in 
occurrences of that constraint. Fortunately, the type system is not 
very smart: when choosing the instances the type-checker looks 
only at the syntactic form of the type terms involved. Therefore, 
to fool the type-checker into thinking that TypeCast a b is more 
polymorphic than it really is, we introduce a series of redirections 
and eventually arrive at the following implementation. 



where typeCast" :: t->a->b 
•ifistftttce TypeCast*'- 0 a b => TypeCast a b 


where typeCast x = typeCast' () x 
istance TypeCa-S-fc" t a b =?> TypeCast' tab 



where typeCast" _ x = x 

The auxiliary classes TypeCast' and TypeCast'' have an extra, 
dummy type parameter, which we instantiate to () in the instances. 
Any other ground type would have sufficed. The key to solving the 
polymorphism quandary is the last instance TypeCast'' () a a. 
It signifies that in the constraint TypeCast" t a b, b is not neces¬ 
sarily a, because t can be something other than (). Semantically, 
though, it can never be anything but. However, the type-checker 
cannot see that and remains satisfied. 

Alas, this implementation is specific to GHC; it does not work 
in Hugs because of the peculiarities of that system with regard to 
multi-parameter type classes and functional dependencies, which 
we briefly hinted at in Sec. 6. That shows that multi-parameter type 
classes with functional dependencies are hard to get right. 

While this code works in GHC and is logically sound, we have 
to admit that we turned the drawbacks of the type-checker to our 
advantage. This leaves a sour after-taste. We would have preferred 
to rely on a sound semantic theory of overloading rather than on 
playing games with the type-checker. Hopefully, the results of the 
foundational work by Sulzmann and others [32, 23] will eventually 
be implemented in all Haskell compilers. 






